Training and Testing Low-degree Polynomial Data Mappings via Linear SVM

نویسندگان

  • Yin-Wen Chang
  • Cho-Jui Hsieh
  • Kai-Wei Chang
  • Michael Ringgaard
  • Chih-Jen Lin
چکیده

Kernel techniques have long been used in SVM to handle linearly inseparable problems by transforming data to a high dimensional space, but training and testing large data sets is often time consuming. In contrast, we can efficiently train and test much larger data sets using linear SVM without kernels. In this work, we apply fast linear-SVM methods to the explicit form of polynomially mapped data and investigate implementation issues. The approach enjoys fast training and testing, but may sometimes achieve accuracy close to that of using highly nonlinear kernels. Empirical experiments show that the proposed method is useful for certain large-scale data sets. We successfully apply the proposed method to a natural language processing (NLP) application by improving the testing accuracy under some training/testing speed requirements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low-degree Polynomial Mapping of Data for SVM

Non-linear mapping functions have long been used in SVM to transform data into a higher dimensional space, allowing the classifier to separate linearly inseparable data. Kernel tricks are used to handle the huge number of features of the mapped data point. However, the training/testing for large data is often time consuming. Following the recent advances in training large linear SVM (i.e., SVM ...

متن کامل

Evaluation of Sentinel-1 Interferometric SAR Coherence efficiency for Land Cover Mapping

In this study, the capabilities of Interferometric Synthetic Aperture Radar (InSAR) time series data and machine learning have been evaluated for land cover mapping in Iran. In this way, a time series of Sentinel-1 SAR data (including 16 SLC images with approximately 24 days time interval) from 2018 to 2020 were used for a region of Ahvaz County located in Khuzestan province. Using InSAR proces...

متن کامل

An Approximate Approach for Training Polynomial Kernel SVMs in Linear Time

Kernel methods such as support vector machines (SVMs) have attracted a great deal of popularity in the machine learning and natural language processing (NLP) communities. Polynomial kernel SVMs showed very competitive accuracy in many NLP problems, like part-of-speech tagging and chunking. However, these methods are usually too inefficient to be applied to large dataset and real time purpose. I...

متن کامل

Support vector machine classification for large datasets using decision tree and Fisher linear discriminant

The training of a support vector machine (SVM) has a time complexity between O(n) and O(n). Most training algorithms for SVM are not suitable for large data sets. Decision trees can simplify SVM training, however the classification accuracy becomes lower when there are inseparable points. This paper introduces a novel method for SVM classification. A decision tree is used to detect low entropy ...

متن کامل

کاربرد ماشین بردار پشتیبان در طبقه‌بندی کاربری اراضی حوزه چشمه کیله- چالکرود

Classification of land use extraction always been one of the most important applications of remote sensing and why different methods are created. Over time and with greater accuracy were developed more advanced methods that increase the accuracy and the extraction classes that were closer together in terms of quality are better. SVM is one of these methods in the study of this method for the ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2010